Pseudo- Power Avoidance 



Ehsan Chiniforooshan Lila Kari Zhi Xu 

The University of Western Ontario, 
Department of Computer Science, 
^ ! Middlesex College, London, Ontario, Canada, N6A 5B7 

{ehsan, lila, zhi_xu}@csd.uwo . ca 

November 13, 2009 



Abstract 

J. 

ry j Repetition avoidance has been studied since Thue's work. In this paper, we consid- 

ered another type of repetition, which is called pseudo-power. This concept is inspired 
by Watson-Crick complementarity in DNA sequence and is defined over an antimorphic 
involution (p. We first classify the alphabet £ and the antimorphic involution (f>, un- 
der which there exists sufficiently long pseudo-fcth-power-free words. Then we present 
algorithms to test whether a finite word w is pseudo-fcth-power-free. 
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^ ! 1 Introduction 

Let £ be an alphabet. The set of finite words and infinite words over E are denoted by X* 
and S w , respectively. Word v is called a factor of x if x — uvw for some words u and w. A 
nonempty word w is called a square if w can be written as w = uu for some «eE', and is 
called a cube if w can be written as w = uuu for some u G X*. For example, the English word 
"murmur" is a square. More generally, for an integer k > 2, a nonempty word w is called a 
kth power if w = u k for some hgE*. A word w is called square-free (respectively, cube-free, 
kth-power-free) if w does not contain any square (respectively, cube, kth power) as a factor. 
In the early 1900 's, Thue showed in a series of papers examples of square- free infinite words 
over 3 letters and 4 letters respectively and cube-free infinite word over binary alphabet 
[201 [21] ( see [2] for English translation of Thue's work). In 1921, Morse [17] independently 
discovered Thue's construction. In 1957, Leech [H] showed another construction of square- 
free infinite word, which is generated by morphism. In 1995, Yu [22J showed square-free 
infinite words that cannot be generated by any morphism. 

Let | w \ a be the total number of letter a that appear in the word w, and let | w \ = 
l^aes I w L f° r w e A nonempty word w is called an abelian square if w is in the 
form w = U1U2 such that | u\ \ — \ U2 L for each letter a. For example, the English word 
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"teammate" is a abelian square. Analogously, wE* is called an abelian cube if w — U1U2U3, 
where | U\ \ = \ u 2 \ = | M3 | for each a G S, and called an abelian kth power if w — 
U\U 2 ■ ■ ■ Uk, where integer k > 2, | Ui \ a — \ Uj | for a G £ and 1 < i, j < k. A word w is called 
abelian- square- free (respectively, abelian- cube- free, abelian-kth-power-free) if w contains no 
abelian square (respectively, abelian cube, abelian kth power) as a factor. In 1957, Erdos 
[7] asked whether there exists an abelian-square-free infinite word. The construction of such 
words were given by Pleasants [18] in 1970 over 5 letters and by Keranen [TU] in 1992 over 4 
letters. Most recently, Keranen [TT] presented a great many new abelian-square-free infinite 
words. In 1979, Dekking [6] discussed abelian- fcth-power-free infinite words for k > 3. 



The discussion on kth power is related to biology. Repeats of certain segments in human 
DNA sequences may predict certain disease [16]. A variation on the kth power, called 
pseudo kth power, is defined by antimorphic involutions, which are generalizations of the 
famous Watson-Crick complementarity function in biology. Other concepts in combinatorics 
on words have been generalized in the setting of antimorphic involutions, such as pseudo- 
primitivity [5] and pseudo-palindrome \9\. A variation on the concept of the pseudo kth 
power has also appeared in tiling problems (see [H [3]), where the function involved is a 
morphism other than an antimorphism. 

In this paper, we will discuss the word that does not contain any pseudo kth power 
as a factor. Here the pseudo power is defined by antimorphic involutions. In Section [21 
we will introduce the concept of pseudo-power-free words and discuss the existence of such 
words over different setting of alphabet and antimorphic involutions. In Section [31 we will 
discuss the algorithms for the decision problem of pseudo-power-freeness. At the end, we 
will summarize the results and present open problems. 

2 Pseudo-power-free infinite words 

Without loss of generality, in the following discussion we always assume the letters are 
0, 1, 2, . . .. The empty word is denoted by e. A function 9 : S* — > S* is called an involution 
if 6{6{w)) = w for all w G £*, and called an antimorphism (respectively, morphism) if 
6(uv) = 6(v)8(u) (respectively, 8(uv) = 8(u)6(v)). We call 6 an antimorphic involution 
if 9 is both an involution and an antimorphism. For example, the classic Watson-Crick 
complementarity in biology is an antimorphic involution over four letters. The mirror image 
is the function of reversing a word as defined by Mir(aia2 ■ • • a n ) = a n ■ ■ ■ a 2 a\. Then the 
mirror image over any alphabet is also an antimorphic involution. A transposition (a, b) is 
a morphism that is defined by (a, b)(a) = b, (a, b) (b) = a, (a, 6)(c) = c for c 7^ a, c 7^ b, a < 
b, a, b, c G E. One can verify that the mirror image and any transposition commute, and two 
transpositions (fli,a 2 ) and (bi,b 2 ) commute for a t 7^ bj,i,j G {1,2}. By definitions, every 
antimorphic involution on the alphabet is a permutation of the letters. Furthermore, we 
have the following proposition. 

Proposition 1. Let 9 be an antimorphic involution over the alphabet S. Then 9 can be 
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uniquely written as the composition of transpositions with a mirror image 

6 = (ao, ai) • (a 2 , a 3 ) • • • (a 2m -2, a 2m -i) • Mir (1) 

up to changing the order of composition, where a^ ^ dj for i ^ j and m > 0. 

Proof. First we show that 6(a) is a single letter for any letter a. By definitions, 6(w) = 
6(ew) = 6(e)6(w), so 6(e) = e. For any a G E, since 6(6(a)) = a, we have | 6(a) \ > and 
| 6(a) | < 2. Hence | 6(a) | = 1 for any a G E. 

Now we prove the existence of the decomposition in Eq. (OQ) for by induction on the 
size of the alphabet. If | E | = 1, then 5(0) = and 6(0 rn ) = m = Mir(0 m ). So 6 = Mir 
and Eq. (P holds. If | E | = 2, then either 0(0) = or 0(0) = 1. One can verify that 
= Mir when 0(0) = and = (0, 1) ■ Mir when 0(0) = 1. Suppose Eq. CQ) holds for 
any | E | < n. For | E | = n > 3, either 0(0) = or 0(0) = a for some a ^ 0, a G E. 
If 0(0) = 0, then by induction hypothesis the restriction of on alphabet E \ {0} can be 
written as (a , a x ) • • ■ (a 2m _ 2 , a 2m -i) • Mir. So = (a , ai) • • ■ (a 2m _ 2 , a 2m -i) • Mir in this case. 
If 0(0) = a, then 6(a) = 0. By induction hypothesis, the restriction of on alphabet S\{0, a} 
can be written as (a , ai) • • • (a 2m _ 2 , a 2m _i)-Mir. So = (0, a)-(a , ai) • • • (a 2m _ 2 , a 2m _i)-Mir. 
Therefore, Eq. (CQ) holds. 

To show the uniqueness of the form in Eq. (CQ), we first notice that every transposition is 
the inverse of itself. Assume there is another decomposition = (bo, bi)-(b 2 , b 3 ) ■ ■ ■ (6 2n _ 2 , b 2n -i) 
Mir, where bi ^ bj for i ^ j and n > 0. Then 6(b ) = bi and thus there is some 
(a p ,a p+ i) = (b ,bi). Since aj ^ aj for i ^ j, the order of composition in Eq. ([1]) can be 
arbitrarily changed. Hence we have 

(b 2 ,b 3 ) ■ ■ ■ (6 2 „_ 2 , 6 2ri _i) ■ Mir 
=(60,60-0 

= (a , ai) ■ ■ ■ (a p _ 2 , a p _i) • (a p+2 , a p+ z) ■ ■ ■ (a 2m _ 2 , a 2m _i) • Mir. 

Continuing this procedure, it follows that Mir = (a' , a[) ■ ■ ■ (a f 2m ,_ 2 , a' 2m ,_ 1 ) - Mir. By the con- 
struction of {a,}, however, we know a- ^ a'j for i ^ j. So m! = 0, since otherwise Mir(a ) = 
a[ ^ a' . Therefore, m = n and {(6 , h),..., (6 2n _ 2 , 6 2 „_i)} = {(a , ax),..., (a 2m _ 2 , a 2m -i)}- 

□ 

For any antimorphic involution over E, we define 

ldt(0) = {a G E : 6(a) = a}, Trn(0) = {a G E : 6(a) > a}. 
The following proposition follows directly from Proposition [Q 

Proposition 2. Let 6 be an antimorphic involution over the alphabet E. Then 6 can be 
written as the composition of \ Trn(0) | distinct transpositions with a mirror image, and 

|ldt(0) | +2|Trn(0) | = |E|. (2) 
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Proof. By the proof of Proposition!]], 9 can be written as 9 = (a , ai)-(a 2 , 0,3) •• ■ (tf2m-2, a>2m-i) 
Mir, where a, 7^ a,,- for i 7^ j. Then Idt(#) = E\ {ao, fli, • • • , 02m-i} an d Trn(#) = 
{a , 02, ■■■ , ct2m-2}- So m = I Trn(0) | and Eq. ([2]) holds. □ 

For integer k and antimorphism 9, we call word w a pseudo kth power (with respect to #) 
if w can be written as w = U\U 2 ■ ■ -u^, where either U; L = Uj or Ui = 9(uj) for 1 < i, j < k. In 
particular, we call pseudo 2nd power by pseudo square, and pseudo 3rd power by pseudo cube. 
For example, over the alphabet {A, T, C, G} (here we use the conventional symbols instead of 
{0, 1, 2, 3}) with respect to the Watson-Crick complementarity (A 1— > T, T 1— > A, C 1— > G, G 1— > C), 
ACGCGT is a pseudo square and ACGTAC is a pseudo cube. By definitions, every pseudo kth 
power is an abelian kth power with respect to the mirror image, and every kth power is a 
pseudo kth power (with respect to any antimorphic involution on the same alphabet). A 
word w is called pseudo-kth-power-free (respectively, pseudo-square-free, pseudo-cube-free) if 
w cannot be written as w = uvx where v is a pseudo kth power (respectively, pseudo square, 
pseudo cube). 

In the remaining of this section, we will discuss the following problem: is there a pseudo- 
/cth-power-free infinite word over £ with respect to 91 The discussion on pseudo-power-free 
words is related to power-free words and abelian-power-free words in the sense of the following 
lemmas. 

Lemma 3. // 1 is the minimal size of alphabet over which there is a kth-power-free infinite 
word, then 

(1) there is no pseudo-kth-power-free infinite word over I — 1 or less letters; and 

(2) there is a pseudo-kth-power-free infinite word over V letters with respect to 9, where 
|Trn(0) \>l. 

Proof. (1) If Z is the minimal size of alphabet over which there is a fcth-power-free infinite 
word, then for any I" < I there is an integer iV such that any word of length greater than TV 
over I" letters contains a /cth-power. Since a fcth-power is a pseudo-fcth-power (with respect 
to any antimorphic involution), any word of length greater than iV over I" letters contains a 
pseudo-fcth-power with respect to 9. 

(2) Let 9 be an antimorphic involution on /' letters such that | Trn(#) | > /. We choose 
£' C Trn(#) such that | £' | = I. Then there is an infinite word w over £' such that w is /cth- 
power-free. Now we claim that w is also pseudo-fcth-power-free over /' letters with respect to 
9. Suppose w contains a pseudo-fcth-power. Then w = xuiu 2 ■ ■ ■ u^y, where either U; L = Uj or 
Ui = 9(uj) for 1 < i,j < k. For any a G by definition, 9(a) > a and 9(9(a)) = a < 9(a). 
So 9(a) $l £' and thus 9(ui) is not a word over £' for all 1 < i < k. Hence Ui = Uj for 
1 < hj < k and Uiu 2 ---Uk is a normal fcth-power, which contradicts the fact that w is 
A;th-power-free. Therefore, w is pseudo-fcth-power-free with respect to 9. □ 

Lemma 4. // there is an abelian-kth-power-free infinite word over I letters, then there is a 
pseudo-kth-power-free infinite word over 21 — 1 letters with respect to arbitrary antimorphic 
involution 9. 
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Proof. By Eq. P) in Proposition [2], it follows that | Trn(0) | < I and thus | ldt(0) | + | Trn(0) | > 
/. We choose E' C ldt(0) U Trn(0) such that | E' | = Z. Then there is an infinite word w over 
E' such that w is abelian-/cth-power-free. Now we claim that w is also pseudo-Zcth-power-free 
over 21 — 1 letters with respect to 8. 

Suppose w = xu\U2 • ■ ■ Uky contains a pseudo-fcth-power, where either u,i = ui or Ui = 
9{ui) for 1 < i < k. Then either u\ is a word over ldt(0) or u\ contains at least one 
letter from Trn(0). If ui G ldt(8)*, then 8(ui) = Mir(ui). So u\Ui ■ ■ - u^ is an abelian /cth 
power, which contradicts the fact that w is abelian- /cth-power-free. Otherwise, U\ contains 
at least one letter from Trn(0), say a. Then since 8(a) > a and 8(8(a)) = a < 8(a), we have 
8(a) G" E' C ldt(0) U Trn(0). So U; L = U\ for 1 < i < k, and thus w contains a kth power, 
which again contradicts the fact that w is abelian-/cth-power-free. 

Therefore, w is pseudo-/cth-power-free with respect to 8. □ 

Lemma 5. // there is a pseudo-kth-power-free infinite word over E with respect to 8, then 
there is a pseudo-kth-power-free infinite word over E' with respect to 8' , where \ Trn(#') | > 
| Trn(#) | and | ldt(0') | + | Trn(^) | > |ldt(0) | + | Trn(^) |. 

Proof. We choose Ex C Trn(^) such that | E x | = | Trn(^) |. Since | Idt(^) | + | Trn(^) | - 
| Tm(8) | > | ldt(0) |, we can choose E 2 C ldt(8') U Trn(0') \ Ex such that | E 2 | = | ldt(8) \. 
Define E" = E x U {8' (a) : a G SjU E 2 , and define antimorphic involution 8" by 8" (a) = 
a, 8"(b) = 8'(b) for a G E 2 , b e E x U {8'(a) : a G E x }. Then | E" | = | E | , \dt(8") = \ E 2 | = 
| \dt(8) | and Trn(0") = | Ei | = | Trn(0) |. So 8 and 8" are identical up to renaming of the 
letters. There is a word w over E" such that w is pseudo- /cth-power-free with respect to 8". 
We claim w is also pseudo-/cth-power-free over E' with respect to 8'. 

Suppose w = xu\U2 ■ ■ ■ Uky contains a pseudo-Zcth-power, where either = Ui or Ui = 
8'(ui) for 1 < % < k. Then either U\ is a word over Ex U {8'(a) : a G Ex} U (E 2 fl ldt(8')) or U\ 
contains at least one letter from E 2 fl (Trn(0') \ Ei). In the former case, 8"(ui) = d'(u\) and 
thus w contains a pseudo- fcth-power with respect to 8", which contradicts to the pseudo-fcth- 
power-freeness of w. In the latter case, we assume Ui contains a G E 2 fl (Trn(0') \ Ex). One 
can verify that 8'(a) G^ E", so Ui ^ 8'(u\) for all 1 < % < k. Hence w contains a /cth-power 
U\U2 ■ ■ -Uk, which again contradicts the pseudo-/cth-power-freeness of w. 

Therefore, w is pseudo-/cth-power-free over E' with respect to 8'. □ 

2.1 Pseudo-square-free infinite words 

First we consider pseudo-square-free infinite words. Since every binary word of length greater 
than 3 contains squares, there is no square-free infinite word over 2 letters. By Keranen's con- 
struction of abelian-square-free infinite words, there exist pseudo-square-free infinite words 
over 4 letters with respect to the mirror image. Furthermore, we have the following result. 

Proposition 6. For ^-letter- alphabet, pseudo-square-free infinite word exists with respect to 
mirror image and does not exist with any other antimorphic involution. 

Proof. There are two kinds of antimorphic involutions over 3 letters: 8 is either the mirror 
image or a transposition composed with the mirror image. 
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Suppose 9 is the mirror image. Then the following morphism I given by Leech [T3] 
preserves square-freeness and presents an infinite word 1^(0), which is pseudo-square-free 
with respect to the mirror image: 

1(0) =0121021201210, 
1(1) =1202102012021, 
1(2) =2010210120102. 

To see w = l u (0) contains no pseudo square, first we observe that w contains no square. If 
w contains pseudo square of the form U1U2 such that U\ = 0(v,2), then U\Ui = ufu2 contains 
a square of length 2 in the middle. Since w is square free, w does not contain pseudo square. 

Now suppose 9 is a transposition composed with the mirror image. Without loss of 
generality, we assume 6 = (0, 1) ■ Mir. We prove no pseudo-square-free infinite word exists 
in this setting. Suppose w is a pseudo-square-free infinite word. A pseudo-square-free word 
(with respect to 9) cannot contains 00, 11, 22, 01, 10, 2012, 2102. So w is either in (02+ 12)" 
or in (20 + 21)+ If we omit all symbol 2 in w, the new infinite word should also be pseudo- 
square-free. But the new word is over {0, 1} and every binary infinite word contains pseudo 
squares, which is a contradiction. So there is no pseudo-square-free infinite word over 3 
letters (with respect to a transposition composed with the mirror image). □ 

There is another way to show the non-existence of pseudo-square-free infinite word over 3 
letters with respect to (0, 1). We use computer to find the longest pseudo-square-free word, 
if any. Starting from empty word e, if a word is pseudo-square-free, then we extend the word 
by adding a new letter at the end; otherwise, we do back-tracking and try the next letter. 
In other words, we do a depth-first-search in a labeled tree, where each node presents a finite 
word. Such tree is called trie and application of similar technique has been appeared in the 
literatures (for example, see [IH]). In the case of 3 letters and 9 = (0, 1) - Mir, the tree is finite 
and all pseudo-square-free words are enumerated. There are in total 91 nodes, including 61 
leaves. The tree is of depth 8 and one of the longest pseudo-square-free words is 0212021. 

Theorem 7. There is no pseudo-square-free infinite word over k letters for k < 2, and 
there is a pseudo-square-free infinite word over k letters for k > 5, with respect to arbitrary 
antimorphic involution. For 3 < k < 4, the existence of pseudo-square-free infinite words 
depends on the antimorphic involution. 

Proof. (1) Since 3 is the minimal size of alphabet over which there is a square-free infinite 
word, by Lemma [31 there is no pseudo-square-free infinite word over k letters for k < 2 and 
there is pseudo-square-free infinite word with respect to 9 for Trn(#) > 3. 

(2) Since there exists an abelian-square-free infinite word over 4 letters, by Lemma HI 
there is a pseudo-square-free infinite word over 7 or more letters. 

(3) By Proposition EJ over 3 letters, there is a pseudo-square-free infinite word with 
respect to mirror image, where | Idt(Mir) | = 3, | Trn(Mir) | = 0, and there is no pseudo- 
square- free infinite word with respect to other antimorphic involution. So by Lemma [5j there 
is a pseudo-square-free infinite word with respect to 9 such that | Idt(#) | + | Trn(#) | > 3. 
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Table 1: The existence of pseudo-square-free infinite words 



|E| 


1 


2 


3 


4 


5 


6 


7 


8 


Trn(#) 


= 


Xl 


Xl 


Vs 




Vs 




y/2,3 


^2,3 


Trn(fl) 


= 1 




Xl 


x 3 


Vs 


Vs 


Vs 


^2,3 


^2,3 


Trn(#) 


= 2 








open 






^2,3 


^2,3 


Trn(#) 


= 3 












y/1,3 


V 1,2,3 


V 1,2,3 


Trn(#) 


= 4 
















\A,2,3 



The result is summarized in Table [T], where the subscription presents which situation of 
the (1), (2), (3) the case falls in. □ 

It is still a open problem that whether there exists a pseudo-square-free infinite word 
over 4 letters with respect to antimorphic involution 0, where Trn(0) = 2. Experimental 
computation shows that there are long pseudo-square-free words in the setting, but we don't 
have proof of the existence of arbitrarily long pseudo-square-free words. But if there are, 
they must satisfy certain conditions as in the following proposition. 

Proposition 8. w is an pseudo-square-free infinite word over 4 letters with respect to 0, 
where Trn(0) = 2, 0(a) = 6, 0(c) = d, a, b,c,d G {1, 2, 3, 4} ; if and only if w G ((a + o)(c + 
d)Y + (( c + d)(a + 6)) w and w is square free. 

Proof. "=>". Since w is pseudo-square-free, w is square free. Furthermore, any word in 
(a + b) 2 + (c + d) 2 is a pseudo square, so the letters in w must appear alternatively from 
{a, 6} and {c, d}. Hence w G ((a + 6)(c + d)) w + ((c + d)(a + 6)) w . 

"••<=". Suppose u> contains a pseudo square. Since w is square free, it must be the case 
that w = ux<f)(x)u. By the definition of antimorphism involution, the last letter of x and the 
first letter of 4>(x) are either both from {a, 6} or both from {c, d}, which contradicts the fact 
w G ((a + 6)(c + d)) w + ((c + d)(a + 6)) w . □ 

2.2 Other pseudo-power-free infinite words 

Now we consider pseudo-cube-free infinite words. By Dekking's construction of abelian-cube- 
free infinite words, there exist pseudo-cube-free infinite words over 3 letters with respect to 
the mirror image. The case over unary alphabet is trivial. For binary alphabet, we have the 
following result. 

Proposition 9. Pseudo-cube-free infinite word does not exist over binary alphabet with any 
antimorphic involution. 

Proof. There are two kinds of antimorphic involutions over binary alphabet: we have either 
9 = Mir or 9 = (0, 1) ■ Mir. 

Suppose 9 = Mir. Again, we use computer to find the longest pseudo-cube-free word, if 
any. Starting from empty word e, if a word is pseudo-cube-free, then we extend the word by 
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appending 0; otherwise, we do back-tracking and try the next letter. The resulted depth- 
first-search tree is finite. There are in total 171 nodes, including 86 leaves. The tree is of 
depth 10 and one of the longest pseudo-cube-free words is 001101100. 

Suppose 9 = (0, 1) • Mir. Similarly, we verified by computer that there are only finitely 
many pseudo-cube-free words. There are in total 15 nodes, including 8 leaves. The tree is 
of depth 3 and one of the longest pseudo-cube-free words is 00. In fact, any word in this 
setting is a pseudo power. □ 

Proposition 10. There is a pseudo-cube-free infinite word over 3 letters with any antimor- 
phic involution. 

Proof. There are two kinds of antimorphic involutions over 3 letters: 9 is either the mirror 
image or a transposition composed with the mirror image. 

Suppose 9 is the mirror image. The following morphism <i 3 given by Dekking [6] presents 
an abelian-cube-free infinite word d% (0) over 3 letters, which is also pseudo-cube-free: 

4(0) =0012, 

d 3 (l) =H2, 
da (2) =022. 

So there is a pseudo-cube-free infinite word d% (0) over 3 letters with respect to mirror image. 

Now suppose 9 is a transposition composed with the mirror image. Without loss of 
generality, we assume 9 = (0, 1) ■ Mir. Consider the following morphism: 

t(0) =021, 
*(1) =120, 
t(2) =2. 

One can verify that the word z = tf"(0) = 02121202120202121202021 • • • is the Thue-Morse 
sequence [21] with letter 2 inserted between every two consecutive letters. Now we prove 
that z is pseudo-cube-free. Suppose z = xw\W2Wzy contains a pseudo-cube W1W2W3 with 
I Wi I = I W2 I = 1 103 |- Either the last letter of W\ or the first letter of W2 is 2, but not both. 
Since 9(2) = 2, we have ui\ 7^ 6(102) ■ So W\ = W2- By the same reason, W2 = W3. Then the 
length of Wi = W2 = W3 must be even. Otherwise, either the first letter of w\ or the first 
letter of W2 is 2, but not both, and thus we have w\ ^ W2- Now since | w± | = | W2 \ = \ W3 | 
is even, we can omit the letter 2 from each word and get new words w[,w 2 ,w' 3 such that 
w[ = w' 2 = w' 3 and w^w^w'^ is a factor of the Thue-Morse sequence, which contradicts the 
fact that Thue-Morse sequence is cube-free. Therefore, z = t w (0) is pseudo- cube-free with 
respect to (0, 1) • Mir over 3 letters. □ 

Theorem 11. There is no pseudo- cube-free infinite word over k letters for k < 2, and 
there is a pseudo -cube- free infinite word over k letters for k > 3, with respect to arbitrary 
antimorphic involution. 
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Table 2: The existence of pseudo-cube-free infinite words 



|E| 


1 


2 


3 


4 


5 


6 


7 


8 


Trn(#) 


= 


Xl 


x 3 


Vs 


Vs 


V^.3 


^2,3 


^2,3 


^2,3 


Trn(fl) 


= 1 




x 3 


Vs 


Vs 


^2,3 


^2,3 


^2,3 


^2,3 


Trn(#) 


= 2 










\/l,2,3 


V 1,2,3 


V 1,2,3 


V 1,2,3 


Trn(#) 


= 3 












\A,2,3 


V 1,2,3 


V 1,2,3 


Trn(#) 


= 4 
















\A,2,3 



Proof. (1) Since 2 is the minimal size of alphabet over which there is a cube-free infinite 
word, by Lemma [31 there is no pseudo-cube-free infinite word over k letters for k < 1 and 
there is pseudo-cube-free infinite word with respect to 9 for Trn(#) > 2. 

(2) There is an abelian-cube-free infinite word d^(0) over 3 letters. By Lemma HJ there 
is a pseudo-cube-free infinite word over 5 or more letters. 

(3) By Proposition [9j there is no pseudo-cube-free infinite word over binary alphabet. 
By Proposition [TO, there is a pseudo-cube-free infinite word over 3 letters. In particular, 
there is a pseudo-cube-free infinite word over 3 letters with respect to mirror image, where 
| Idt(Mir) | = 3, | Trn(Mir) | = 0. So by Lemma [51 there is a pseudo-cube-free infinite word 
with respect to 6 such that | ldt(0) | + | Trn(6 l ) | > 3. 

The result is summarized in Table [2], where the subscription presents which situation of 
the (1), (2), (3) the case falls in. □ 

We now discuss other pseudo-fcth-power-free infinite words with k > 4. Every word over 
a single letter is a power. So the unary case is trivial and no X-free infinite word exists for 
X being either fcth-power, or abelian-/cth-power, or pseudo-fcth-power. 

Theorem 12. For any integer k > A, there is no pseudo-kth-power-free infinite word over 
I letters for I < 1, and there is a pseudo-kth-power-free infinite word over I letters for 
I > 3, with respect to arbitrary antimorphic involution. For binary alphabet, the existence of 
pseudo-kth-power-free infinite words depends on the antimorphic involution. 

Proof. (1) There is no pseudo-power- free infinite words over unary alphabet. Since there 
is a fcth-power-free infinite word over binary alphabet, by Lemma [31 there is a pseudo-fcth- 
power-free infinite word with respect to 9 for Trn(6 l ) > 2. 

(2) There exists abelian-4th-power-free infinite word over binary alphabet, such as the 
following construction by Dekking [6] w = d%(0) where 

d 4 (0) =011, 
d 4 (l) =0001. 

So there exists an abelian-fcth-power-free infinite word w over binary alphabet for any integer 
k > 4. By Lemma HI there is a pseudo-fcth-power-free infinite word over 3 or more letters. 

(3) That infinite word w = d%(0) is also a pseudo-fcth-power-free infinite word for k > 4 
over binary alphabet with respect to the mirror image, where | Idt(Mir) | = 2, | Trn(Mir) | = 
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Table 3: The existence of pseudo-fcth-power-free infinite words for integer k > 4 



|E] 


1 


2 


3 


4 


5 


6 


7 


8 


Trn(#) 


= 


Xl 






^2,3 


V2,3 


^2,3 


^2,3 


^2,3 


Trn(#) 


= 1 




x 3 


^2,3 


^2,3 


^2,3 


^2,3 


^2,3 


^2,3 


Trn(#) 


= 2 








Vl,2,3 


\/l,2,3 


V 1,2,3 


V 1,2,3 


V 1,2,3 


Trn(#) 


= 3 












Vl,2,3 


V 1,2,3 


V 1,2,3 


Trn(#) 


= 4 
















Vl,2,3 



0. So by Lemma EJ there is a pseudo-fcth-power-free infinite word for k > 4 with respect to 
# such that I ldt(0) | + | Trn(#) | > 2. If 9 = (0, 1) • Mir over binary alphabet, then any word 
is a pseudo power. 

The result is summarized in Table El where the subscription presents which situation of 
the (1), (2), (3) the case falls in. □ 



3 Testing pseudo-power-freeness of words 

In this section, we will discuss the following problem: given a finite word w and integer 
k > 2, does w contain any pseudo- /cth-power as a factor? First, we will discuss the general 
algorithm for arbitrary k. 

3.1 General algorithm for arbitrary kth. pseudo-power 

The naive algorithm runs in 0(N 3 ) time to decide whether w contains any pseudo-/cth-power 
as a factor. The idea is that we check each possible candidate factors u of w to see whether 
u is a pseudo-fcth-power. There are 0(N 2 ) factors and check whether a word is a pseudo 
kth power can be done with O(N) comparisons of letters. 

Here we describe an 0(n 2 lgn)-time algorithm to decide weather an input string w of 
length n contains a &;-th pseudo-power of a word or not. Our algorithm has three steps: in 
the first step, it constructs an n x n x logn x 2 zero-one matrix A such that 

A i>j;ki0 = 1 iff w[i .. 1 + 2 k - 1] = w[j ..j + 2 k - 1], 
A itjA1 = 1 iff w[i .. i + 2 k - 1] = (f>{w[j .. j + 2 k - 1]). 

Then, using A, the algorithm constructs a set of binary strings 

Si\ — n — 2i + 1, Si[£] = 1 iff w[£ .. £ + 2i — 1] is a pseudo-square}. 
Having Sj's, it is easy to find a pseudo- fcth-power, if there exists any. 

Lemma 13. Given {s{ : 1 < i < \n/2\} and k as inputs, there is an algorithm with time 
linear to ^2 |sj] that finds all pseudo-kth-powers in w. 



{Si : 1 < i < 



n 
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Proof. In linear time, for all 1 < i < \n/2\, we will break the string Sj into i strings 
s i,i, s i,2, ■ ■ ■ , -Sj,i such that s it j consists of the characters at positions j, j + i, j + 2i, . . . in Sj. 
This can be done trivially in linear-time. 

Now, observe that there is a pseudo-fcth-power in w starting at position x of length k x y 
if and only if s^-i mo d y)+i has k — 1 consecutive Is starting at position \x/y\. □ 

Our method is summarized in Algorithm [TJ 



Algorithm 1: PSEUDO-PoWER-FREENESS(w, k) 
Initial A to a zero matrix; 
for all i and j such that 1 < i, j < n do 
if w[i] = w[j] then Ajj )0i o <— 1; 
if w[i) = (j>(w[j)) then A iJflA <- 1; 
end 

1 for = 1 . . . |lg n\ do 

for all i and j such that 1 < i, j, < n — 2 k + 1 do 

if Ai,j,k-i,o = 1 ^d A+2 fe - 1 ,j+2 fe - 1 ,fc-i,o = 1 then A i:jjk:0 <— 1; 
if A,j+2 fc - 1 ,fc-i,i — 1 an d A+2 fc - 1 ,j,fc-i,o — 1 then Ai^i ^~ 1; 

end 

2 for a// i swc/i that 1 < i < \n/2\ do 

for 1 < I < n - 2% do 

Let ii, i 2 , ■ ■ ■ , ii be distinct integers such that i = J2l=i ^'i 

if A e T y-i 2lx e+i+y <v-i 2 i x i = 1 for all y = 1,2, ... ,1 then <- 1; 

if A^ +r „-i 2ia; <+ai _ r » 2*» .„ i = 1 f or ally = 1,2,..., I then Si[£] *- 1; 

1 'a; — 1 ' 1 'a; — 1 ' f 1 

end 

3 for all i such that 1 < % < \n/2\ do 

Break Sj into i strings s^i, s^, . . . , Si,i as described in Lemma [TBI 
if i/iere exists 1 < j ; < % such that Sij contains k — 1 consecutive Is then return 
_ NO; 
return YES; 



Theorem 14. Algorithm^ runs in time 0(n 2 lgn) and returns YES if and only if w does 
not have any pseudo-kth-power as a substring. 

Proof. The running time of block 1 and 2 is 0(n 2 \gn) and block 3 runs in 0(n 2 ) as 
explained in Lemma [TBI 

As for the correctness, it is enough to show that, after block 1 and 2, the matrix A and 
the set of strings {sj : 1 < i < \n/2\} have the values that the are supposed to have; i.e. 

A iJA0 = l iff w[i. . i + 2 k -l] = w[j..j + 2 k -l), (3) 
A hjA1 = l iff w[i.. i + 2 k -l] = cf>(w[j..j + 2 k -l}), (4) 
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and 



Vz, t s.t. 1 < £ < i : Si[£] = 1 iff w[£ .. £ + 2% — 1] is a pseudo-square. (5) 

To prove that (1) holds, we use induction on k. For k = (1) holds because of the 
initialization before block 1. Assuming that (1) holds for k = 0,1, ... ,k', it is easy to 
see that (1) holds for k! + 1: u>[i. . .i + 2 h ' +1 - 1] = lofj? . . . j + 2 fc ' +1 - 1] if and only if 
w[i...i + 2 k> -1] = w[j ...j + 2 k ' -1] andw[i + 2 k> . . .i + 2 k ' +1 -l] = w[j + 2 k ' . . .j + 2 fc ' +1 -l]. 

Proving (2) is similar to proving (1). 

For (3), note that 

1. w[£ ..£ + 2% — 1] is a pseudo-square if and only if w[£ .. £ + % — 1] = w[£ + i .. £ + 2i — 1] 
orw[£..£ + i-l) = <f)(w[£ + i..£ + 2i-l)). 

2. w[£..£ + i-l]=w[£ + i..£ + 2i- 1] if and only if w[£ + J2lZ\ ^ ~Z + ELi 2 ^ ~ l \ = 
w[£ + i + YZZ\ $"..£ + i + ELi 2ix - !] for all y = 1, 2, . . . , J, where h, i 2 ,-..,ii are 
distinct integers such that i = X^i 2 * 1 - 

3. w[£..£+i-l] = cf)(w[£+i..£+2i-l]) if and only if w[£+J2lZ\^ ..^+ELi 2 ^- 1 ] = 
(f)(w[£ + 2i- YZ=x 2 ix ..£ + 2i- YZ=i 2 ix - 1]) for all y = 1, 2, . . . , I, where i x , i 2 ,...,ii 
are distinct integers such that % = Y, x =i 2ix - D 

In the following subsection, we consider, for fixed small k, whether a given word w is 
pseudo-fcth-power-free. 

3.2 Testing pseudo-square- freeness 

Theorem 15. To decide whether a word w contains a pseudo-square as a factor can be done 
in linear time. 

Proof. Let N = \ w\ . A word w contains a pseudo-square if and only if w contains a square 
or a word of the form w<p(w). 

To check whether w contains a square can be done in linear time. There are a few works 
in the literatures on testing square- freeness in linear time jH [T5] . 

To check whether w contains a word of the form u<ft(u), it is enough to check whether 
w contains a word a<f)(a) for a letter a. To see this, if w contains u<j)(u), then let a be the 
right-most letter of u and w contains a<p(a); for the other direction, word a<p(a) itself is a 
pseudo-square. 

The algorithm is illustrated in Algorithm [2j It is obvious that the algorithm is linear. □ 

3.3 Testing pseudo-cube-freeness 

Before we show a cubic time algorithm for the pseudo-cube-freeness of a word, we first intro- 
duce some concepts. Let w = w[l .. N] be a finite word over S and let <fr be an antimorphic 
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Algorithm 2: Decide whether w is pseudo-square-free in linear time 
Input: a word w = w[l .. N]. 

Output: "YES" if w is pseudo-square-free; "NO" otherwise. 

1 if w contains a square then return "NO"; 

2 for i from 1 to iV — 1 do 

3 [ if cj)(w[i}) = w[i + 1] then return "NO"; 

4 end 

s return "YES"; 



involution with the same alphabet X. A right minimal periodic rmp w [l .. N] of w is a vector 
and is defined by 

rmp w [i] = min {min{ra : w[i .. i + 2n — 1] = x 2 for some x ^ e}, +00} , 

and similarly a left minimal periodic lmp w [l .. N] of w is defined by 

Zmp^fz] = min |min{n : w[i — 2n + 1 ..i] = x 2 for some x 7^ e}, +00} . 

For example, when w = 01001010, we have rmp w = [3, +00, 1, 2, 2, +00, +00, +00] and 
lmp w = [+00, +00, +00, 1, +00, 3, 2, +00]. A centralized maximal pseudo-palindrome cmp w [0 .. N] 
of w (with respect to 0) is a vector and is defined by 

cmp w [i] = max {max{m : <p{w[i — m.. i — 1]) = w[i .. i + m — 1]}, 0} . 

For example, when w = 01001010 and = Mir, we have cmp w = [0, 0, 0, 3, 0, 0, 0, 0, 0]. The 
left-most and right-most elements of cmp w are always 0. 

Lemma 16. For any fixed integer k, the right (respectively, left) minimal periodic rmp w 
(respectively, lmp w ) of word w can be computed in linear time 0(\w\). 

There is an algorithm to compute rmp w , the shortest square starting at each position, in 
linear time [13] by using suffix tree. Since vector lmp w can be obtained by first computing 
V = rmpMir(w) and then reversing V, vector lmp w can also be computed in linear time. 

Lemma 17. The centralized maximal pseudo-palindrome cmp w can be computed in linear 
time 0(| w |). 

Lemma [T71 has been proved in the book P, page 197-198], which claimed all the maximal 
even-length palindromes can be found in linear time. 

Now we are ready to show a quadratic time algorithm to test the pseudo-cube-freeness 
of a given word w of length N. By definition, a pseudo-cube is in one of the following form 
xxx, xx(f>(x), (p(x)xx, and xcp(x)x. In order to check whether w contains any pseudo-cube, 
we check each of the four cases. 

To check whether w contains any word of the form xxx can be done in linear time. Word 
w contains a cube if and only if one of the maximal repetition in w has exponent > 3, and 
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there is linear algorithm [12] to find all the maximal repetitions. So this case can be checked 
in O(N) time. 

To check whether w contains any word of the form xx<p(x), we check whether there is a 
pair of factors w[i — 2n + 1 .. i] = yy and w[i — m + 1 .. i + m] = z(j){z) that overlap in the 
sense that n < m. By the definitions of lmp w and cmp w , we only need to check for each 
position i whether lmp w [i] < cmp w [i]. this can be done in O(N) time when all lmp w ,cmp w 
are already computed. The case for (p(x)xx is similar. 

To check whether w contains any word of the form x<j)(x)x, we check whether there is a 
pair of factors w[i — n + 1 .. i + n\ = y<fi{y) and w[j — m + 1 .. j + m] = z<fi(z) that overlap in 
the sense that | % — j \ < n and | % — j \ < m. By the definition of cmp w , we check for each 
pair of indices i, j with i < j whether j — i < cmp w [i} and j — i < cmp w [j]. This can be done 
in 0(N 2 ) time when cmp w is already known. 



Algorithm 3: Decide whether w is pseudo-cube-free in linear time 
Input: a word w = w[l .. N]. 

Output: "YES" if w is pseudo-cube-free; "NO" otherwise. 

1 compute rmp w , lmp w , cmp w ; 

2 if w contains a cube then return "NO"; // The case xxx 

3 for i from 1 to iV do 



if rmp w [i] < cmp w [i — 1] then return "NO"; // The case <p(x)xx 
if lmp w [i] < cmp w [i] then return "NO"; // The case xx<p{x) 
for d from 1 to cmp w [i] do 
| if d < cmp w [i + d] then return "NO"; II The case X(p(x)x 
end 



4 
5 
6 
7 

8 

9 end 

10 return "YES 



The completed algorithm is given in Algorithm [3j So we have the following theorem. 

Theorem 18. To decide whether a word w contains a pseudo-cube as a factor can be done 
in quadratic time. 

Proof. Let N = \w\. Algorithm [3] checks the pseudo-cube-freeness of w in quadratic time. 
By Lemma [T6l and Lemma [T71 the computation of rmp w ,lmp w ,cmp w in line 1 can be done 
in O(N) times. Line 2 can be done in O(N) time. Line 3-10 can be done in 0(N 2 ) time. 
So the algorithm runs in 0(N 2 ) time. 

Now we prove the correctness of the algorithm. First, we prove that if the algorithm 
returns "NO", then w contains a pseudo cube. If the algorithm return at line 2, then w 
contains a cube of the form xxx, which is also a pseudo cube. Suppose the algorithm return 
at line 4. Let n = rmp w [i] and m = cmp w [i — 1]. Then n < m and the word w[i— n .. i + 2n — 1] 
is of the form (p(x)xx, which is a pseudo cube. Suppose the algorithm return at line 5. Let 
n = lmp w [i] and m = cmp w [i] . Then n < m and the word w [i — 2n + 1 . . i + n] is of the form 
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xx(j)(x), which is a pseudo cube. Suppose the algorithm return at line 7. Then the word 
w[i — d+1 + 2d] is of the form xcf>(x)x, which is a pseudo cube. 

Now, we prove that if w contains a pseudo cube, then the algorithm returns "NO". If 
w contains a pseudo cube of the form xxx, then the algorithm returns at line 2. Suppose 
w[s .. s + 3p — 1] = xxcj)(x). Then q = lmp w [s + 2p— 1] < | x | and cmp w [s + 2p— 1] > | x \ > q. 
So the algorithm returns at line 5 for i — s + 2p — 1, (although the detected pseudo cube 
w[s+2p— 2q .. s+2p+q— 1] may be different from w[s .. s+3p— 1].) The case w[s .. s+3p— 1] = 
4>(x)xx is similar. Suppose w[s .. s + 3p — 1] = x<p(x)x. Then cmp w [s + p — 1] > | x | and 
c^P™ [s + 2p — 1] > | x | . So the algorithm returns at line 7 for i = s + p — 1 and d = p. □ 

4 Conclusion 

In this paper, we discussed the existence of infinite words that do not contain pseudo-A;th- 
power. For alphabet size < 2, there is no pseudo-square- free infinite words and for alphabet 
size > 5, there exist pseudo-square-free infinite words. For other alphabet size, the existence 
of pseudo-square-free infinite words depends on the antimorphic involution <p. For alphabet 
size < 2, there is no pseudo-cube-free infinite words and for alphabet size > 3, there exist 
pseudo-cube-free infinite words. For alphabet size > 3, there exist pseudo- /cth-power-free 
infinite words for any integer k > 4. For binary alphabet and any integer k > 4, the 
existence of pseudo-/cth-power-free infinite words depends on the antimorphic involution <fi. 

We also discussed the algorithm for testing whether a given word w is pseudo-/cth-power- 
free. For arbitrary kth pseudo-power, there is a 0{n 2 lgn)-time algorithm to find all kth 
pseudo-power in w, where n = \w\. For k = 2, there is a 0(n)-time algorithm for testing 
pseudo-square-freeness of word w of length n. For k = 3, there is a 0(n 2 )-time algorithm 
for testing pseudo-cube-freeness of word w of length n. 
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